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Ligation 
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.Hindlll 



BamHI 



pBMesterasel a 



Conversion of BamHI to Smal 



EcoRI. 
EcoRVjPstl 




Hindlll 



Smal 



pBMesterasel b 
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Smal/ EcoRV esterase fragment 
from pBMesterase3 



.EcoRV 
EcoRI 

Pstl .BamHI 



Pstl 



Hindlll Smal 




BamHI 
Smal 
EcoRI. 



Hindlll 




• 3.8 kb EcoRV/Smal fragment 
inserted into Smal site of pSJC62. 



Hindlll 



Smal 



pSJC62.3 

FIG. 4 
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Mscl 
Ncol 



BamHI 



BamHI. 
Smali 




BamHI cleavage 
Klenow 
Mscl cleavage 
1 .2 kb fragment gel purified 

BamHI. 
Ncol I 



Ligation 




Ncol cleavage 
BamHI cleavage 



Ncol. 



Ligation 



Smal cleavge 
BAP treated 



pSJC62.3 



Ncol 



PCR amplification 
Rc2a+ primer 
Rc4- primer 

BamHI 



Esterase 



Esterase PCR 
Ncol/BamHI cleavage 



BamHI 




pBMesterasel 1 
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N -TERMINAL AMINO ACID SEQUENCE 



AMINO ACID SEQ. 



T N P N E P 



REV. TRANSLATION 



ACX AAPy CCX AAPy GAPu CC 



INVERSE 



GGPy TCPu TTX GGPu TTX GT 



PROBE 



1 
2 
3 
4 



GGPy TCPu TTG GGPu TTX GT 
A 
T 
C 



Four 17-mer oligonucleotide probes each with a 32-fold degeneracy were 
synthesized from the N-terminal amino acid sequence and used to probe 
Southern blot of R.toruloides DNA. 
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RHODOSPORIDIUM ESTERASE cDNA 

ATGCTCCTTAACCTCTTCACCCTCGCCTCCCTCGCTGCGACGCTCCAGCTCGCCTTTGCC 70 
MLLNLFTLAS LAATLQLAFA 

TCTCCGACCTCCCTCGTCCGCCGCACGAACCCAAACGAGCCCCCTCCCGTCGTCGACCTC 130 
S PTSLVRRTN PN EPPPVVDL 

GGCTACGCCCGCTACCAAGGCTACTTGAACGAGACCGCCGGACTCTACTGGTGGCGCGGA 190 
GYARYQGYLN E T AGLYWWRG 

ATCCGCTACGCCTCGGCTCAGCGCTTCCAGGCTCCTCAGACGCCCGCGACGCACAAGGCC 250 
I RYASAQRFQAP QTPATHKA 

GTCCGCAACGCGACTGAGTATGGACCGATCTGTTGGCCGGCTAGCGAGGGAACCAACACG 310 
VRNATEYGPI CW PASEGTNT 

ACCAAGGGCTTGCCGCCGCCTAGCAACAGCTCGAGCAGCGCGCCGCAGAAACAGGCGTCG 370 
TKGLPPPSNSS S SAPQKQAS 

GAGGATTGCCTCTTCCTCAATGTCGTTGCCCCCGCCGGCTCGTGCGAGGGCGACAATCTT 430 
EDCLFLNVVAPAGSCEGDNL 

CCCGTCCTCGTCTACATTCACGGAGGTGGCTACGCCTTCGGCGATGCGAGCACCGGCAGC 4 90 

PVLVYIHGGGYAFGDASTGS 

GACTTTGCCGCCTTCACCAAGCACACGGGAACCAAGATGGTCGTTGTAAATCTCCAGTAC 550 
DFAAFTKHTGTKMVVVNL QY 

CGTCTCGGCAGCTTTGGTTTCCTCGCTGGCCAAGCCATGAAGGACTACGGTGTAACGAAC 610 
RLGS FGFLAGQAMKDYGVTN 

GCCGGCTTGCTTGACCAGCAATTCGCCCTTCAATGGGTTCAACAGCACGTCTCGAAGTTC 670 
AGLLDQQFALQWVQQHVSKF 

GGCGGCAACCCCGATCACGTTACGATTTGGGGCGAGTCTGCAGGCGCAGGGTCCGTTATG 730 
GGNPDHVTIWGE SAGAGSVM 

AACCAGATCATTGCGAACGGCGGCAACACCGTCAAGGCTCTCGGTCTCAAGAAGCCCCTC 790 
NQI IANGGNTVKALGLKK PL 

TTCCACGCTGCCATCGGCTCCTCCGTCTTCCTCCCCTACCAAGCCAAGTACAACTCCCCC 850 
FHAAIGSSVFL PYQAKY NS P 

TTCGCCGAGCTGCTCTACTCCCAACTCGTCTCGGCGACAAACTGCACCAAAGCCGCCTCG 910 
FAELLYSQLVSATNCTKAAS 

TCCTTCGCTTGCCTCGAAGCTGTCGACGCTGCGGCGCTCGCTGCGGCGGGCGTGAAGAAC 970 
S FACLEAVDAAALAAAGVKN 

TCGGCGGCGTTCCCGTTCGGGTTTTGGTCGTATGTCCCGGTCGTCGACGGGACCTTCTTG * 1030 
SAAFPFGFWS YVPVVDGTFL 

FIG. 8A 
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TCGGGCAAGCAGCTCCTCTTCAACACGACGACGAGGGACACCCTCTCTCCCGCCGACCCG 
SGKQLLFNTTTRDTLSPADP 

CGCATCGTTGAGACTTCAAGCTTGACCGACTTTGGCACGAGCCAGAAGACCAAGTGCGAC 
RIVETSSLTDFGTSQKTKCD 

TTCTGGCGTGGGTCAATCTCGGTGAACGCGGGTCTC 
FWRGS ISVNAGL 

FIG. 8B 



ACTGAGCGCGCGTCGCTCCTTCTCGCCAAGGGCAAGAAGAACCTCAATGGCAACCTCTTC 1090. 
TERASLLLAKGKKNLNGNLF 

ACCGGGATCAACAACCTCGACGAAGGATTCATATTCACTGACGCCACTATTCAGAACGAC 1150 
TGINNLDEGFIFTDATIQND 

ACGATCAGCGACCAGTCGCAGCGCGTCTCCCAGTTCGACCGCCTCCTCGCCGGCCTCTTC 1210 
TISDQSQ RVSQFDRLLAG LF- 

CCCTACATCACCTCGGAGGAGCGCCAGGCCGTCGCGAAGCAGTACCCGATCTCCGACGCG 1270 
PYITSEERQAVAKQYPISDA 

CCGTCAAAGGGCAACACCTTCTCTCGCATCTCGGCCGTCATCGCGGACTCGACCTTCGTC 1330 
PSKG NTFSRISAVIADSTFV 

TGCCCGACCTACTGGACCGCCGAGGCGTTCGGCTCGTCCGCCCACAAGGGCCTCTTCGAC 1390 
CPTYWTAEAFGS SAHKGLFD 

TACGCGCCGGCTCACCACGCGACCGACAACTCGTACTACATCGGCTCCATCTGGAACGGC 1450 
YAPAHHATDNSYYIGSIWNG 

AAGAAGTCGGTCTCGTCCGTCCAGTCCTTCGACGGCGCGCTCGGCGGCTTCATCGAGACG 1510 
KKSVSSVQSFDGALGGFIET 

TTCAACCCGAACAACAACGCTGCCAACAAGACCATCAACCCTTACTGGCCGACGTTCGAC 
FNPNNNAANKTINPYWPTFD 



1570 



1630 



1690 



1726 
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GGATCCACCCGAACTCTGTCCCGCTTTCTGGCTTTCTTCCTTGCTGTCGCCCCATCGCCT 

| — Translation Start — > 
TTCCCGACTCGCCGCCATGCTCCTTAACCTCTTCACCCTCGCCTCCCTCGCTGCGACGCT 

MLLNLFTLASLAATL 



60 
120 



| - Mature peptide -> 
CCAGCTCGCCTTTGCCTCTCCGACCTCCCTCGTCCGCCGCACGAACCCAAACGAGCCCCC 180 
QLAFASPTSLVRRTN PNEPP 

TCCCGTCGTCGACCTCGGCTACGCCCGCTACCAAGGCTACTTG AACGAGACCGCCGGACT 240 
PVVDLGYARYQGY LNETAGL 

CTACTGGTGGCGCGGAATCCGCTACGCCTCGGCTCAGCGCTTCCAGGCTCCTCAGACGCC 300 
YWWRGIRYASAQRFQAPQTP 

CGCGACGCACAAGGCCGTCCGCAACGCGACTGAGTATGGACCGATCTGTTGGCCGGCTAG 3 60 
ATHKAVRNATEYG P ICWPAS 

CG AGGGAACCAACACGACCAAGGGCTTGCCGCCGCCTAGCAACAGCTCGAGCAGCGCGCC 420 
EGTNTTKGLPPPSNS SSSAP 

GCAGAAACAGGCGTCGGAGGATTGCCTCTTCCTCAATGTCGTTGCCCCCGCCGGCTCGTG 4 80 
QKQAS EDCLFLNVVA PAGSC 

CGAGGGCGACAATCTTCCCGTCCTCGTCTACATTCACGGAGGTGGCTACGCCTTCGGCGA 540 
EGDNLPVLVYIHGGGYAFGD 

TGCGAGCACCGGCAGCGACTTTGCCGCCTTCACCAAGCACACGGGAACCAAGATGGTCGT 600 
ASTGSDFAAFTKHTGTKMVV 

TGTAAATCTCCAGTACCGTCTCGGCAGCTTTGGTTTCCTCGCTGGCCAAGCCATGAAGGA 660 
VNLQYRLGSFGFLAGQAMKD 

[ Intron #1 

CTACGGTGTAACGAACGCCGGCTTGCTTGACCAGGTGAGTTTCCCGCATGATACCCGCCC 720 
YGVTNAGLLDQ 

ACCTTTCGACTCATGCTGACGCCTCTCCCGCTC^ 780 

QFALQWVQ 

ACAGCACGTCTCGAAGTTCGGCGGCAACCCCGATCACGTTACGATTTGGGGCGAGTCTGC 840 
QHVSKFGGNPDHVTIWGESA 

[ Intron #2 

AGGCGCAGGGTCCGTTATGAACCAGATCATTGCGAACGTGAGCCACCCGAACCGATCTCC 9 00 
GAGSVMNQIIAN 

AGCCGACTTTCCCCCCCCCCCCCCCCCCGCTGAC^^ 9 60 

G G N T 

CCGTCAAGGCTCTCGGTCTCAAGAAGCCCCTCTTCC ACGCTGCCATCGGCTCCTCCGTCT 1020 
VKALGLKKPLFHAA IGSSVF 

TCCTCCCCTACCAAGCCAAGTACAACTCCCCCTTC^ 1080 
LPYQAKYNSPFAELLYSQLV 



FIG. 9A 
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CTGCGGCGCTCGCTGCGGCGGGCGTGAAG AACTCGGCGGCGTTCCCGTTCGGGTTTTGGT 1200 
AALAAAGVKNSAAFPFGFWS 

CGTATGTCCCGGTCGTCGACGGGACCTTCTTGACTG AGCGCGCGTCGCTCCTTCTCGCCA 1260 
YVPVVDG TF LTERASLLLAK 

( Intron 13 

AGGGCAAGAAGAACCTCAATGGCGTGCGTGGCGAGCTTTCGAGTGCTTCAGGATCTCGCT 1320 
G K K N L N G 



GACACTGTCG ACCGGCTCGC AG AACCTCTTCACCGGGATCAAC AACCTCGACGAAG ATG A 1380 

NLFTGINNLDEG 

Intron #4 ] 

GTTCCCGTCGACGGCTCTGTTCGCCCAGCGAGACTGACTTGTTCTTTTGCGAAGATTACG 1440 

ATTC ATATTCACTGACGCCACTATTCAG AACG AC ACGATC AGCGACC AGTCGCAGCGCGT 1500 
FIFTDATIQNDTISDQSQRV 

CTCCCAGTTCGACCGCCTCCTCGCCGGCCTCTTCCCCTACATCACCTCGGAGGAGCGCCA 1560 
SQFDRLLAGLiFPYITSEERQ 

GGCCGTCGCGAAGCAGTACCCG ATCTCCGACGCGCCGTCAAAGGGCAACACCTTCTCTCG 1620 
AVAKQYPI SDAPSKGNTFS R 

( Intron #5 

CATCTCGGCCGTCATCGCGGACTCGACCTTCGTGTGCGTTCCCCGTCGTCTTCTCCGAGT 1680 
I S AVIADSTFV 

j 

ATTCCGCTGACTTCCCGCTTGCCCGCAGCTGCCCGACCTACTGGACCGCCGAGGCGTTCG 17 4 0 

CPTYWTAEAFG 

GCTCGTCCGCCCACAAGGGCCTCTTCGACTACGCGCCGGCTCACCACGCGACCGACAACT 1800 
S SAHKG LFDYA PAHHATDN S 

CGTACTACATCGGCTCCATCTGGAACGGCAAGAAGTCGGTCTCGTCCGTCCAGTCCTTCG 1860 
YYIGSIWNGKKSVSSVQSFD 

ACGGCGCGCTCGGCGGCTTCATCGAGACGTTCAACCCGAACAACAACGCTGCCAACAAGA 1920 
G ALGGFIETFNPNNNAANKT 

CCATCAACCCTTACTGGCCGACGTTCGACTCGGGCAAGCAGCTCCTCTTCAACACGACGA 1980 
INPYWPTFDSGKQLLFNTTT 

CGAGGGACACCCTCTCTCCCGCCGACCCGCGCATCGTTGAGACTTCAAGCTTGACCGACT 2040 
RDTLSPADPRIVETSSLTDF 

TTGGCACGAGCCAGAAGACCAAGTGCGACTTCTGGCGTGGGTCAATCTCGGTGAACGCGG 2100 
GTSQKTKCDFWRGSISVNAG 

GTCTCTAGGCGTCTTTCCTTCCGACTTCCTTCGTTCTTTCGTTGTTTATTCTTGCAGTTC 2160 
L * 

CGTTGTATCGGCCATTCGTGCGTGTAGCTCACTCGAGTATAGACGTTGGCAAGTGCGAAA 2220 

FIG. 9B 
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TRN 2-1738 RHODOSPORIDIUM ESTERASE cDNA$ 
TTranslation Start TMature Peptide 

MLLNLFTLASLAATLQLAFAS PT SLVRRTN PNEP P P WDLG YARYQG YLNETAGLYWWRG 

IRYASAQRFQAPQTPATHKAVRNATEYGPICWPASEGTNTTKGLPPPSNSSSSAPQKQAS 

EDCLFLNWAPAGSCEGDNLPVLVYIHGGGYAFGDASTGSDFAAFTKHTGTKMVWNLQY 

RLGSFGFIAGQAMKDYGVTNAGLLDQQFALQWVQQHVSKFGGNPDHVTIWGESAGAGSVM 

NQIIANGGNTVKALGLKKPLFHAAIGSSVFLPYQAKYNSPFAELLYSQLVSATNCTKAAS 

SFACLEAVDAAALAAAGVKNSAAFPFGFWSYVPVVDGTFLTERASLLLAKGKKNLNGNLF 

TGINNLDEGFIFTDATIQNDTISDQSQRVSQFDRLLAGLFPYITSEERQAVAKQYPISDA 

PSKGNTFSRISAVIADSTFVCPTYWTAEAFGSSAHKGLFDYAPAHHATDNSYYIGSIWNG 

KKSVSSVQSFDGALGGFIETFNPNNNAANKTINPYWPTFDSGKQLLFNTTTRDTLSPADP 

RIVET SSLT DFGTSQKT KCDFWRG S I SVNAGL * 

FIG. 10 



Amino acid composition from 1 to 572 
TRN 2-173 8 RHODOSPORIDIUM ESTERASE cDNA 



Total 


Percent 


A 


67 


11.7 


C 


7 


1.2 


D 


25 


4.4 


E 


16 


2.8 


F 


35 


6.1 


G 


49 


8.6 


H 


9 


1.6 


I 


21 


3.7 


K 


25 


4.4 


L 


48 


8.4 


M 


4 


0.7 


N 


35 


6.1 


P 


31 


5.4 


Q 


26 


4.5 


R 


16 


2.8 


S 


52 


9.1 


T 


43 


7.5 


V 


32 


5.6 


W 


10 


1.7 


Y 


21 


3.7 


Acidic 


41 


7.2 


Basic 


41 


7.2 


Charged 


82 


14.3 


Net charge 


0 


0.0 


Hydrophobic 


136 


23.8 



Residues 572 
MW 61334 
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